Side-chain and backbone ordering in a Polypeptide 

Yanjie Wei 

Department of Physics, Michigan Technological University, Houghton, MI 49931, USA 

\-^'. Walter Nadler 

O, 

O . Department of Physics, Michigan Technological University, Houghton, MI 49931, USA 

(N 

> . 

Q . Ulrich H.E. Hansinann 

'^ ' Department of Physics, Michigan Technological University, Houghton, MI 49931, USA and 

^ ' John-von-Neumann Institute for Computing, 

r-| ' Forschungszentrum Jiilich, D-52425 Jiilich, Germany 

(Dated: February 2, 2008) 

Abstract 

.^ , We report results from niulticanonical simulations of polyglutamic acid chains of length ten 

t^' residues. For this simple polypeptide we observe a decoupling of backbone and side-chain ordering 

, *^ - in the folding process. While the details of the two transitions vary between the peptide in gas phase 

and in an implicit solvent, our results indicate that, independent of the specific surroundings, upon 

>, 

OO \ continuously lowering the temperature side-chain ordering occurs only after the backbone topology 

OO 
^^ ' is completely formed. 

o 
> 

X 



I. INTRODUCTION 

The folding of a protein into its biologically-active structure involves a number of struc- 
tural transitions. Examples are the collapse into an ensemble of compact configurations, 
the formation of secondary structure elements such as a elices or /? heets, or the ordering 
of side chains. The role of these transitions in the folding process and their thermal order- 
ing are still only poorly understood. Computer experiments offer one way to study these 
questions but are often hampered by poor convergence of the simulations^ . Only with the 
development of parallel tempering^i^i^, multicanonical sampling5.i£, and other generalized en- 
semble techniques^ has it become possible to sample efficiently in all-atom models low-energy 
configurations of proteins with up to ~ 50 residues^. This is especially important for inves- 
tigations of side-chain ordering where one cannot fall back to the use of lattice proteins and 
other minimal protein models that usually ignore side chains. For this reason, we have used 
multicanonical sampling™ which was first introduced to protein science in Ref. |9| to study 
the role of side- chain ordering in the folding process. In order to simplify the problem we 
have ignored the effects of sequence heterogeneity, i.e. side chains of different size. We have 
focused on a homopolymer, polyglutamic acid, as this amino acid has very long side chains 
that can interact through hydrogen bonds. In that way one can expect that the effects of 
side chains on the folding process are maximized. Chains of a length of ten residues were 
simulated both in gas phase and with an implicit solvent. In both cases we observed a two- 
step process that, upon lowering the temperature, starts with a coordinated helix-formation 
and collapse of the polypeptide chain. Only at much lower temperatures, after the backbone 
is fully organized, we find ordering of the side chains. The arrangement of the side chains 
is due to the competition between the interactions among the side chains themselves and 
the interactions between them and the surrounding environment. Consequently, the effect 
of side-chain ordering is weaker for the solvated molecule than for the one in gas phase as 
the solvent screens the interside- chain interactions. However, the two-step process itself 
is independent of the specific model indicating that the de-coupling of backbone and side- 
chain ordering does not depend on the details of the surroundings and, therefore, could be 
a common characteristic in protein folding. 



II. METHODS 

Our investigations rely on simulations of GIuiq with the ECEPP/3 force field^ as im- 
plemented in the 2005 version of the program package SMMP— »i^. Here the interactions 
between the atoms within the homopolymer chain are approximated by a sum Eecepp/3 
consisting of electrostatic energy Ec, a Lennard- Jones term Eu, hydrogen-bonding term 
Ehb and a torsion energy E^or'- 

-£'ECEPP/3 = Ec + Elj + Ehb + Etor 
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where r^j is the distance between the atoms i and j, ^i is the Ith torsion angle, and energies 
are measured in kcal/mol. The protein-solvent interactions are approximated by a solvent 
accessible surface term 

Esolv = 22 ^*^« ■ (^) 

i 

The sum goes over the solvent accessible areas Ai of all atoms i weighted by solvation 
parameters ctj as determined in Ref. Il3|, a common choice when the ECEPP/3 force field 
is utilized. Note that E^oiv is a rather crude approximation of the interaction between the 
polypeptide and the surrounding water that is motivated by the low computational costs 
when compared to simulations with explicit water molecules. Because of that its reliability is 
not always clear—. Especially the temperature scale may be distorted (leading, for instance, 
to transitions at temperatures where in nature water would be vaporized). However, our 
previous experiences^!^ have shown that our energy function reproduces qualitatively the 
effects of protein-water interaction correctly. 

The evaluation of this detailed energy function is not only computationally more ex- 
pensive than that of minimal protein models but the competing interactions lead also to 
an energy landscape that is characterized by a multitude of minima separated by high 
energy barriers. As the probability to cross an energy barrier of height AE is given by 



ex]i{—AE / ksT) {ks the Boltzinann constant) it follows that extremely long runs are neces- 
sary to obtain sufficient statistics in regular canonical simulations at a low temperature T. 
Hence, in order to enhance sampling, we rely on the multicanonical approacb^i^ as described 
in Ref. l9l. Here, configurations are weighted with an iteratively determined term wmu{E) 
such that the probability distribution 

Pmu{E) oc n{E)wMu{E) ~ const , (3) 

where n{E) is the spectral density of the system. Thermodynamic averages < O > at 
temperature T are obtained by re-weighting^^: 

where x counts the configurations of the system. 

After determining the multicanonical weights wmu{E), we have performed multicanonical 
simulations of 5 x 10® sweeps. Each sweep consists of 70 Metropolis steps that try to update 
each of the 70 dihedral angles (the degrees of freedom in our system) once. Every 10 steps 
various quantities are measured and written to a file for further analysis. These include the 
energy E with its respective contributions from Eq. ([T]) and - in the case of the simulations 
in solvent - from the protein-solvent interaction energy Egoiv The radius of gyration Vgy 
is a measure of the geometrical size, and the number of helical residues Uh, i.e. residues 
where the pair of dihedral angles (0, ip) take values in the range (—70° ± 30°, —37° ± 30°)^^. 
Finally, we monitor the number of hydrogen bonds, uhb where we distinguish between 
hydrogen bonds along the backbone and hydrogen bonds between side chains. 

III. RESULTS AND DISCUSSIONS 

Our aim is to study the relationship between side-chain ordering and other transitions 
for the example of the homopolymer GIuiq. We first investigate the case of molecules in gas 
phase. Fig. [T] displays the specific heat per molecule 

C{T) = kBf3\< E^>-<E>^) (5) 

as a function of temperature. Two peaks are observed in this plot indicating two transitions. 

The first peak is located at a temperature Ti = 590 K and is relatively broad, with a 

half-width of about 160K. The corresponding plot of the helicity in Fig. [2] shows that this 



peak separates a high-temperature region where the backbone has no ordering from a region 
where temperatures are low enough to allow the formation of backbone hydrogen bonds (see 
the inlay to Fig. [2]) and subsequent growth of an a-helix. While the width of the specific 
heat peak is comparable to that of Alaio^^, the transition temperature here is considerably 
higher (Ti = 427 K for Alaio). Since the side chains of Glu are larger than in the case of 
Ala, they provide sterical hindrances to backbone conformations, leading to a decrease of the 
backbone entropy. As the transition is driven by entropy, this leads to a higher transition 
temperature. In general, it is not uncommon that transition temperatures in the gas phase 
are relatively higb^i and it has been verified experimentally that helices can be stable in the 
gas phase up to high temperatures^ii^. 

The helix-coil transition is also connected with a collapse of the molecule. This can be 
seen from the inlay of Fig. [1] where we display the average radius of gyration Tgy as a function 
of temperature. We observe a monotonic drop in the radius of gyration, starting already at 
higher temperatures but continuing through the transition regime as defined by the specific 
heat peak. Below the transition Vgy stabilizes, reflecting the stable helical structure that has 
been reached. 

The inlay also shows that the second peak in the speciflc temperature that is observed 
at a lower temperature T2 = 164 K in Fig. [1] is not related to the collapse of the molecule. 
Hence, it is reasonable to assume that this transition is related to an ordering of the side 
chains in our molecule. This hypothesis is supported by Fig. [3] where we display the average 
number nfj, of side-chain hydrogen bonds as function of temperature. The fluctuations of 
this quantity, x{T) = ( (n^i^ — (n^b)) ), are shown in the inlay. The change in the number 
of side-chain hydrogen-bonds that is observed in this flgure at T2 clearly shows that the 
corresponding peak in the speciflc heat indicates indeed a second transition that separates 
now a low-temperature phase with ordered side chains from a phase at temperatures above 
T2 where the backbone is ordered but the side chains are still fluctuating. The form of the 
side-chain ordering can be seen best from the lowest energy conformation displayed in Fig. HI 
Here, as also already described earlier in Ref. 120, the side chains nestle along the cylinder 
formed by the helix and are stabilized by the side chain hydrogen bonds (not shown in the 
flgure). Hence, our results indicate that Gluio "folds" in gas phase in a two-step process 
that - upon continuously lowering the temperature - starts with a concurrently occurring 
collapse and secondary structure formation. Only after the backbone geometry is flxed can 



the side chains ahgn themselves, too, in a second step. 

In nature, proteins are solvated and the details of the protein-solvent interaction are 
important for the structure and function of a protein. Hence, it is not clear whether our 
results obtained in gas phase apply also for solvated proteins. For this reason we have 
extended our investigation in a second step to that of solvated GIuiq. As in the case of the 
molecule in gas phase, we observe again two peaks in the specific heat (see Fig. [5]). The 
peak at the higher temperature Ti = 477 K marks again the collapse (see the inlay of Fig. [5]) 
and subsequent formation of an a-helix. The later can be seen from Fig. [6] where we display 
again the average helicity and average number of backbone hydrogen bonds. Note that the 
transition temperature is by more than 100 K lower than in the gas phase. Also, the peak in 
the specific heat is higher and narrower, indicating a sharper, more well-defined transition. 
Both features are actually known from earlier studies on polyalanine^ii^. The reason for the 
shift to a lower temperature is the competition between the formation of backbone hydrogen 
bonds that stabilize an a helix, and that of hydrogen bonds between the backbone and the 
solvent in the coil phase, the energetic contribution of the latter being described in a mean 
field way by the solvent energy term ([2]). While the transition in gas phase is driven solely 
by entropy, here also a part of the energy favors the coil phase. These effects collaborate so 
that the transition takes place at a lower temperature and becomes sharper. The sharper 
transition is also particularly visible in the radius of gyration, shown in the inlay of Fig. [51 In 
contrast to the gas phase, both quantities remain practically constant above the transition. 
At Ti, however, both show a sharp drop and remain practically constant again in the ordered 
phase. 

The actual value of the transition temperature is still higher than that of the corre- 
sponding value for Alai^ {T\ = 333 K). The absolute differences between the transition 
temperatures of both molecules, AlaiQ and GIuiq, are relatively similar, 163 K in vacuum 
and 144 K in solvent. The reasons discussed for this shift in vacuum appear to apply here 
in the solvent case, too: Lower backbone entropy due to larger side chains. We remark that 
experimental measurements would allow for an adjustment of the temperature scale which 
seems to be incorrect for the ECEPP/3 force field. 

The second peak in the specific heat that is observed in Fig. [5] at the lower temperature 
T2 = 111 K is more narrow and smaller than the corresponding one for the molecule in 
gas phase (Fig. [1]). As with the coil-helix transition temperature Ti, this transition is also 



shifted to lower temperatures in the solvent, albeit by the smaller amount of 53 K. As in the 
case of of Gluio in gas phase, the side chains are ordered at temperatures below T2. This can 
be seen for the example of the lowest energy configuration that was found in our simulation 
of the solvated Gluio molecule. This structure is shown in Fig. [7| and is characterized by 
side chains that order themselves by extending into the solvent. This is in strong contrast 
to the gas phase, where the the side chains nestle along the helical cylinder and the ordering 
results from hydrogen bond formation between the side chains. Practically no side-chain 
hydrogen bonds are observed in the low temperature phase ((^fh) < 0.04) for the solvated 
molecule. 

No correlation is observed between the number of side-chain hydrogen bonds and the 
peak of the specific heat at T2 in Fig. O Hence, the mechanism by which the side chains 
order themselves has to be different in water from in gas phase. The radial orientation of 
the side chains in the lowest energy configuration of Fig. [7] suggest that they try to enhance 
exposure to water. As Gluio is a hydrophilic molecule such behavior would be reasonable as 
it would decrease the solvation energy. Although no pronounced decrease in the solvation 
energy can be seen around T2, the fluctuations of the solvation energy show a pronounced 
increase, see Fig. [HI Note that in this figure the fluctuations are normalized by temperature, 
< SE"^ > /kBT"^, to allow a comparison with the specific heat ([5]). These solvent energy 
fluctuations at T2 are actually even larger than at the helix-coil transition. There the energy 
fluctuations and, hence, the specific heat are dominated by the fluctuations of the internal 
energy. In addition, at the helix-coil transition the fluctuations of both energy contributions 
exhibited a peak at the transition, together with a negative peak of the cross correlation, the 
latter denoting that the fluctuations of internal and solvent energies are anti-correlated. In 
contrast, around T2 we observe an increase of the solvent energy fluctuations that continues 
to lower temperatures, while the fluctuations of the internal energy show a plateau and 
decrease below T2. The peak in the specific heat around T2 is actually due to increasing 
anticorrelations below T2. The latter reflects the fact that in the side-chain ordered phase 
fluctuations in the solvent energy are countered by corresponding negative fluctuations in 
the internal energy. 

Hence, while the structure of the solvated molecule also evolves in a two-step process 
upon continuously lowering the temperature, the mechanism that leads to the second step, 
the side-chain ordering, is different from the gas phase. There the ordering of side chains 



was due to formation of hydrogen bonds between the side chains. However, in water, the 
polar side chains interact directly with the surrounding water. Water screens them from 
forming hydrogen bonds among each other. 

IV. SUMMARY AND OUTLOOK 

Our results indicate that Gluio "folding" in gas phase is a two-step process, starting - 
upon continuously lowering the temperature - with a collapse which becomes concurrent 
with the secondary structure formation. Only after the backbone geometry is fixed the side 
chains can align themselves along the helical cylinder, too, stabilizing themselves by forming 
hydrogen bonds with each other. 

In solvent, Gluio "folds" also in a two-step process upon continuously lowering the tem- 
perature. However, in contrast to the above scenario, the collapse is concurrent with the 
secondary structure formation that exhibits a much sharper transition than in vacuum. 
Side-chain ordering takes place, too, but it has a different character. Side chains do not 
align themselves along the helical cylinder but rather, extend into the solvent, which screens 
them from forming hydrogen bonds among themselves. 

Our results indicate that the de-coupling of backbone and side-chain ordering does not 
depend on the details of the environment. Hence, it is reasonable to assume that this process 
could be a common characteristic in protein folding. In order to test this hypothesis, we 
are now looking at the dependence of side-chain ordering on size and chemical properties. 
Glutamine (Gin), for example, is about the size of glutamic acid and should also be able 
to participate in hydrogen bonds. Aspartic acid (Asp) and asparagine (Asn) are also able 
to form hydrogen bonds but have a smaller size, while lysine (Lys) has a larger polar side 
chain. Investigations of these molecules along the lines sketched in this contribution will 
add more detailed knowledge to side chain ordering in polypeptides. 
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Figure captions: 



Fig. 1 Specific heat C{T) as function of temperature T for Gluio in gas phase as obtained 
from a muhicanonical simulation with 5 x 10^ sweeps. The inlay shows the average 
radius of gyration < Vgy > (T) . 

Fig. 2 Average number of helical residues < nu > (T) as function of temperature T for 
Gluio in gas phase as obtained from a multicanonical simulation with 5 x 10^ sweeps. 
The inlay shows the corresponding average number < nf^ > (T) of backbone hydrogen 
bonds as function of temperature T. 

Fig. 3: Average number of side chain hydrogen bonds < nf^ > (T) as function of tem- 
perature T for Gluio in gas phase as obtained from a multicanonical simulation with 
5 X 10^ sweeps. The inlay shows the corresponding fluctuation x(T) as function of 
temperature T. 

Fig. 4: Lowest energy configuration of Gluio in gas phase as obtained from a multicanonical 
simulation with 5 x 10^ sweeps and subsequent minimization. The picture has been 
obtained with PYMOL. 

Fig. 5: Specific heat C{T) as function of temperature T for solvated Gluio as obtained 
from a multicanonical simulation with 5 x 10^ sweeps. The inlay shows the average 
radius of gyration < Vgy > (T) . 

Fig. 6 Average number of helical residues < uh > (T) as function of temperature T for 
the solvated Gluio as obtained from a multicanonical simulation with 5 x 10^ sweeps. 
The inlay shows the corresponding average number < nf^ > (T) of backbone hydrogen 
bonds as function of temperature T. 

Fig. 7: Lowest energy configuration of the solvated Gluio as obtained from a multicanonical 
simulation with 5 x 10^ sweeps and subsequent minimization. The picture has been 
obtained with PYMOL. 

Fig. 8: Fluctuations of the solvation energy < SE'^^i^ > (T) and intramolecular energy 
< SE'^cEPP/3 ^ (-^) f*-*^ solvated Gluio as obtained from a multicanonical simulation 
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with 5 X 10^ sweeps. In addition, the cross-correlation < SEsoIvSEecepp/3 > (T) is 
shown. Note that 5E = E- < E >. 
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